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Abstract 

It is known that the dual of the general adversary bound can be used to build quantum 
query algorithms with optimal complexity. Despite this result, not many quantum algorithms 
have been designed this way. This paper shows another example of such algorithm. 

We use the learning graph technique from [Bell lb] to give a quantum algorithm for k- 
distinctness problem that runs in o(n 3 ^ 4 ) queries, for a fixed k, given some prior knowledge on 
the structure of the input. The best known quantum algorithm for the unconditional problem 
uses 0(n k/{k+1) ) queries. 

1 Introduction 

This paper is a sequel of jBelllb] on applications of span programs, or, more generally, dual of 
the Adversary Bound, for constructing quantum query algorithms for functions with 1-ccrtificate 
complexity bounded by a constant. Also, we use the computational model of a learning graph. 
In the aforementioned paper, a reduction of a learning graph to a quantum query algorithm was 
done using the notion of a span program, another computational model, proven to be equivalent 
to quantum query algorithms in the papers of Reichardt et al. [Reill, LMR +TT] , 

Two questions remained open from the last paper. Firstly, the logarithmic increase in the 
complexity for functions with non-Boolean input; and whether a learning graph that uses values 
of the variables to weight its arcs has more power than the one that doesn't. We fully resolve 
the first concern by switching from span programs to a more general notion of the dual of the 
adversary bound that possesses the same properties, and, thus, getting a query algorithm with the 
same complexity as the learning graph, up to a constant factor. 

For the analysis of the second problem, we have chosen the fc-distinctness problem for k > 2. 
This is the most symmetric problem for which the knowledge of the values of variables can be 
important in construction of the learning graph. Let us define this problem here. 

The element distinctness problem consists in computing the function / : [m] n — > {0, 1} that eval- 
uates to 1 iff there is a pair of equal elements (known as collision) in the input, i.e., f{x\, . . . , x n ) = 1 
iff 3i =/= j : x- L = Xj. The quantum query complexity of the element distinctness problem is well 
understood. It is known to be 0(rt 2 / 3 ), the algorithm given by Ambainis |Amb07j and the lower 
bound shown by Aaronson and Shi [AS04] for the case of large alphabet size Vt{n 2 ) and by Ambai- 
nis |Amb05] in the general case. Even more, the lower bound i7(n 2 / 3 ) holds if one assumes there 
is either no, or exactly one collision in the input. 

The k- distinctness problem is a direct generalization of the element distinctness problem. Given 
the same input, function evaluates to 1 iff there is a set of k input elements that are all equal. 
The situation with the quantum query complexity of the fc-distinctness problem is not so clear. 
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As element distinctness reduces to the ^-distinctness problem by repeating each element k — 1 
times, the lower bound of fl(n 2 ^ 3 ) carries over to the ^-distinctness problem (this argument is 
attributed to Aaronson in |Amb07j ). However, the best known algorithm requires 0{n k '^ k+1 ') 
quantum queries |Amb07| . 

As the above reduction shows, the ^-distinctness problem can only become more difficult as k 
increases. There is another difficulty that arises when k > 2 — this the huge diversity in the inputs. 
For element distinctness, all inputs that are distinct are essentially the same — they are all related 
by an automorphism of the function. Similarly, without loss of generality, one may assume that 
an input which is not distinct has a unique collision, and again all such inputs are related by an 
automorphism. When k > 2 this is no longer the case. For example, for k = 3 inputs can differ in 
the number of unique elements. 

Main theorem In this paper, we show how one can fight the first difficulty, but we ignore the 
second one. Before explaining how we do so, let us give some additional definitions. 

Let (a;i)ie[ n ] be the input variables for the fc-distinctness problem. Assume some subset J C [n] 
is fixed. A subset / C J is called t-subtuple with respect to J if 

Vi, j € I : Xi = Xj , Vi £ I Vj £ J \ I : Xi ^ xj and |/| = t, (I) 

i.e., if it is a maximal subset of equal elements and it has size t. For the important special case 
J = [n], we call them t-tuples. If I is such that only the first condition of ([T]) is satisfied, we call 
it subset of equal elements. 

We give a quantum algorithm for the fc-distinctness problem that runs in o(n 3 / 4 ) queries for 
a fixed k, but with the prior knowledge on the number of t-tuples in the input. Using the same 
reduction as in |Amb07j . it is easy to show the complexity of this problem is Q(n 2 / 3 ) as well. 

Theorem 1. Assume we know the number of t-tuples in the input for the k- distinctness problem 
for all t = 1, . . . , k— 1 with precision 0(^/n). Then, the problem can be solved in 0(jn x ~ 2 ^ 2 
quantum queries. The constant behind the O depends on k, but not on n. 

The precision in the formulation of the theorem can be loosened, 0{\fn) is the most obvious 
value that works for all A;'s. See Section 14.31 for more details. Concerning the complexity of the 
algorithm, it is the exact one, and we do not know whether it can be improved. 

Organization of the Paper The paper is organized as follows. In Sectional we define basic 
notions from quantum query complexity and probability theory. In Section [21 we define learning 
graphs and give a quantum algorithm for computing them. In Section El we develop some tools 
and get ready for Section [SJ where we prove Theorem [1] 

2 Preliminaries 

Let [to] denote the set {I, 2, . . . , to} and consider a function /: [m] n D T> —> {0, 1}. We identify 
the set of input indices of / with [n] . An assignment is a function a : [n] D S — ¥ [to] . One should 
think of this function as fixing values for input variables in S. We say input x = (^i)jg[ n ] agrees 
with assignment a if a{i) = Xi for all i £ S. If S C [n], by xs, we denote the only assignment on 
S that agrees with x. 

An assignment a is called a b-certificate for / if any input from V, consistent with a, is mapped 
to b by /. The certificate complexity C x {f) of function / on input x is defined as the minimal 
size of a certificate for / that agrees with x. The ^-certificate complexity C^ b '(f) is defined as 
max xe/ -i (b ) C x (f). 
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We use [a, b] and }a, b[ to denote closed and open, respectively, intervals of R; R + to denote the 
set of non-negative reals. For the real vector space K. m , we use the ^oo-norm, ||ac||oo = max^ \xi\. 
In particular, we denote the £oo-ball of radius d around x by B(x, d). We use 13(d) to denote the 
ball with center zero and radius d. 

For the complex vector spaces C m , however, we use a more common ^-norm, ||x|| = y/Yli \ x i\ 2 - 



2.1 Adversary bound 

In this paper, we work with query complexity of quantum algorithms, i.e., we measure the com- 
plexity of a problem by the number of queries to the input the best algorithm should make. Query 
complexity provides a lower bound on time complexity. For many algorithms, query complexity 
can be analyzed easier than time complexity For the definition of query complexity and its basic 
properties, a good reference is [BdW02] , 

The adversary bound, originally introduced by Ambainis |Amb02] . is one of the most important 
lower bound techniques for quantum query co mplexity. In fact, a strengthening of the adversary 
bound, known as the general adversary bound IHLS07 , has recently been shown to characterize 
quantum query complexity, up to constant factors [Reilll |LMR + Il] . 

What we actually use in the paper, is the dual of the general adversary bound. It provides 
upper bounds on the quantum query complexity, i.e., quantum query algorithms. Due to the 
same results, it also is tight. Despite this tight equivalence, the actual applications of t his upper 
bound (in the form of span programs) have been limited, mostly, to formulae evaluation |RS] . and, 
recently, linear algebra problems [Bell la] . In [Bell lb) , it was used to give a variant of an optimal 
algorithm for the element distinctness problem, and an algorithm for the triangle problem having 
better complexity than the one known before. In this paper, we provide yet another application. 

The (dual of the) general adversary bound is defined as follows. 

Definition 2. Let /: [m] n — > {0, 1} be a function. 



Adv ( f ) =minimize max >^ 



u 
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subject to 2_, ( u x,j\ u y,j) = 1 whenever f(x) ^ /(?/)■ 

3 

For our application, it will be more convenient to use a different formulation of the objective 
value. 

Claim 3. 



Adv ± (/) =minimize 



(max V] ||w K j|| 2 



max \\u„ "" 

keN \ I se/- 1 (i) f-f " j Iwez-Ho)^. 

subject to (u x j\u y j) = 1 whenever f(x) ^ f(y) 



(3) 



j 



Proof. The objective value in Eq. (J3]) is less than that of Eq. by the inequality of arithmetic and 
geometric means. For the other direction, note that the constraint is invariant under multiplying 
all vectors u x ,j where f(x) — 1 by c and all vectors u Vt j where f(y) = by c _1 . In this way we 
can ensure that the maximum in Eq. @ is the same over / _1 (0) and / _1 (1) and so equal to the 
geometric mean. □ 
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The general adversary bound characterizes bounded-error quantum query complexity. 
Theorem 4 ( [Reilll ILMRJTi] ). Let f be as above. Then Q 1/4 (f) = 6(Adv ± (/)). 

2.2 Martingales and Azuma's Inequality 

We assume the reader is familiar with basic notions of probability theory. In this section, we state 
some concentration results we will need in the proof of Theorem[TJ The results are rather standard, 
can be found, e.g., in [AS08]. 

A martingale is a sequence Xo, . . . , X m of random variables such that E[Xi+i | Ao, . . . , Aj] = 
A,;, for all i's. 

Theorem 5 (Azuma's Inequality). Let = Ao, . . . , A m be a martingale such that | Aj+i — Aj| < 1 
for all i 's. Then 

Pr[A m > \\/m\ < e- x2/2 . 

for all A > 0. 

A standard way of defining martingales, known as Doob martingale process, is as follows. As- 
sume f{y\, . . . , y m ) if a real- valued function, and there is a probability distribution Y on the input 
sequences. The Doob martingale D , . . . , D m is defined as 

Dt = E [f{y') | Vj < i : y' 3 = Vj ] 
y'eY 

that is a random variable dependent on y £ Y. In particular, Dq = E[/] and Z? m = /(y). 
This is a martingale, and Azuma's inequality states f(y) isn't far away from its expectation with 
high probability, if revealing one input variable has little effect on the expectation of the random 
variable. 

3 Learning graphs 
3.1 Definitions 

By Theorem^ to upper bound the quantum query complexity of a function, it suffices to construct 
a feasible solution to Eq. @. Trying to come up with vectors which satisfy all pairwise equality 
constraints, however, can be quite challenging even for simple functions. 

A learning graph, introduced in [Belllbj , is a computational model that aids in the construction 
of such vectors for a function /: [to]" D T> — >• {0, 1} with boolean output. By design, a learning 
graph ensures that the constraint Q is satisfied, allowing one to focus on minimizing the objective 
value. 

Definition 6. A learning graph Q is a directed acyclic connected graph with vertices labeled by 
subsets of [n], the input indices. It has arcs connecting vertices S and S U {j} only, where S C [n] 
and j £ [n] \ S. Each arc e is assigned a weight function w e : [m] s — > M + , where S is the origin of 
e. 

A learning graph can be thought of as modeling the development of one's knowledge about the 
input during a query algorithm. Initially, nothing is known, and this is represented by the root 
labeled by 0. When at a vertex labeled by S C [n], the values of the variables in S have been 
learned. Following an arc e connecting S to S U {j} can be interpreted as querying the value of 
variable Xj. We say the arc loads element j. When talking about vertex labeled by S, we call S 
the set of loaded elements. 
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In order for a learning graph to compute function / correctly, for any x G / _1 (1), there should 
exist a vertex of the learning graph containing a 1-certificate for x. We call vertices containing a 
1-certificate accepting. 

Let e be a weighted arc of the learning graph from S to S U {j}. In the examples of learning 
graphs given in [Belllb] . it sufficed to assign e a weight w e that depended only on the set S and 
element j, but not the values learned. Here, we follow Remark 4 of [B elllbj and use a more general 
model where w e can depend both on S and j, as well as on the values of the variables in S. We 
denote w e (x) = w e {xs). Although, this notation is convenient, it is important to keep in mind 
that values of the variables outside S do not affect the value of w e . The weight of an arc should 
be thought of as the arc is missing for this particular input. 

By G(x), we denote the instance of Q for input x G T>, i.e., Q(x) has the same vertices and arcs 
as Q does, only the weight of arc e is a real number w e = w e {x). Another way to think of a leaning 
graph, is like a collection of graphs Q(x) such that arcs from S to S U {j} in G(x^) and G(x^) 
have equal weight if X Q> X o . The arcs with the latter property are called identical. 

Arcs is the main constituent of the learning graph, and we use notation e G Q to denote that e 
is an arc of Q. Similarly, we write e G Q(x). 

The complexity of a learning graph computing / is defined as the geometrical mean of its 
positive and negative complexities. The negative complexity J\f(Q(y)) for y G / _1 (0) is defined as 
J2eeg(y) We ' The negative complexity of the learning graph A/"(t/) is defined as maxj, e y-i( ) N(G(y))- 
In order to define positive complexity, we need one additional notion. 

Definition 7. The flow on Q(x) for x G / _1 (1) is a real-valued function p e where e G G(x). It has 
to satisfy the following properties: 

• vertex is the only source of the flow, and it has intensity 1. In other words, the sum of p e 
over all e's leaving is I; 

• vertex S is a sink iff it is accepting. That is, if S ^ and S does not contain a 1-certificate 
of x for / then, for vertex S, the sum of p e over all in-coming arcs equals the sum of p e over 
all out-going arcs. 

The complexity of the flow is defined as X)ege(^) Pe/ W ei with convention 0/0 = 0. The positive 
complexity V(G(x)) is defined as the smallest complexity of a flow on G(x). The positive complexity 
of the learning graph V(G) is defined as max xG j-in) V(G(x)). 

We often consider a collection of flows p for all x G / _1 (1). In this case, p e {x) denotes the flow 
p e in G{x). 

Let us briefly introduce some additional concepts connected with learning graphs. The i-th 
step of a learning graph is the set of all arcs ending in a vertex of cardinality i. If E C Q is a set 
of arcs, we use notation pe = J2 e eEPe- Usually, E is a subset of a step. A special case is ps with 
S being a vertex; it is used to denote the flow through vertex S, i.e., the sum of p e over all arcs 
ending at S. 

The following technical result is extracted from [Belllb] 

Lemma 8 (Conditioning). Suppose V is a subset of vertices such that no vertex is a subset 
of another. Let p e be a flow from and ending at V of intensity 1, W be a subset of V and 
t = ^2sew Ps ■ Then there exists a flow p' with the same properties, such that p' s = ps/t for 
S G W andp' s = 0, otherwise. Moreover, the complexity of p' is at most 1/t 2 times the complexity 
ofp- 

This lemma is applied as follows. One uses some construction to get a flow that ends at V. 
After that, another construction is applied to obtain a flow that starts at W and ends at the proper 
sinks, i.e., accepting vertices. In the second flow, vertices in V \ W are dead-ends, i.e., no flow 
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should leave them. Then it is possible to apply Lemma [5] to glue both parts of the flow together 
and get a valid flow. 



3.2 Reduction to Quantum Query Algorithms 

In this section, we prove that if there is a learning graph for / of complexity C then Qi/i{f) = 0(C). 
In the case of / with non-boolean input alphabet this solves an open problem from [Belllb] by 
removing a logarithmic factor present there. We do this by showing how a learning graph can be 
used to construct a solution to Eq. © of the same complexity, and then appealing to Theorem fJJ 

Theorem 9. If there is a learning graph for f: [m] n —¥ {0, 1} with complexity C then Adv ± (/) < 
C. 

Proof. Let Q be the learning graph, w e be the weight function, and p be the optimal flow. 

We show how to construct the vectors u x j satisfying ([3]) from Q. Let Ej be the set of arcs 
e s,Su{j} between S and SU {j} for some S. Notice that the set of {Ej}j & ^ partition all the arcs 
in the graph. If e = es,su{j}i ^ a ( e ) € [m] s be an assignment of values to the set labeling the 
origin of e. 

The vectors u x j will live in a Hilbert space (J) eeB . a / e \ H e ,a{e) where a(e) € [m] s is an as- 
signment of values to the positions in S. In our case each -ff e ,a(e) — C. Thus we think of 
u x,j — (B e g_E a (e) u x,j.e,a(e)> an d now go about designing these vectors. 

First of all, if e = es t su{j} then u x j yt , a ^ = if xs ^ a(e). Otherwise, if f(y) = then we set 
«jj, e ,a(e) = \/we{y) and if f(x) = 1, we set u xJ ^ e ^ e) = p e (x) / y/wjx) . 

Let us check the objective value. If f(y) — then we have 



If f(x) = l then 



E IK;II 2 -EE Mv) = E Mv) = moiv))- 

E ii^-n 2 = E E ^rr = E p 4? = new. 

Thus the geometric mean of these quantities it is at most C . 
Let us now see that the constraint is satisfied. 

y ] ( u x,j\ u y,j) = y ] y ] ( u x,j.e,x s \ u y,j,e,xs) 

j-.Xj^Vj 3-Xj^Vj e S,Su{j} 

xs=VS 

= E E 4==V^) = E E ^) = i. 

j e s,su{ 3 } V W e{ X ) j e SiSU{i} 

xs=ys,xj=£yj x s =ys,xj^Vj 

The second equality from the end holds because w e (x) = w e (xs) = w e (ys) = w e (y) due to the 
construction of the weight function. To see why the last equality holds, note that the set of arcs 
e s,Suj where xs = ys an d Xj ^ yj is the cut induced by the vertex sets {S | xs = ys} and 
{S | xs ys}- Since the source is in the first set, and all the sinks are in the second set, the value 
of the cut is equal to the total flow which is one. □ 



4 Getting Ready 

This section is devoted to the analysis of the applicability of learning graphs for the fc-distinctness 
problem, without constructing the actual learning graph. In Section 14. 1[ we review the tools 
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of [Belllbj to the case when the arcs of the learning graph depend on the values of the variables. 
In Section 14.21 we make the tools of in Section 14.11 easier to apply. In Section 14.31 we describe 
the conventions on the input variables we assume for the rest of the paper. In Section 14.41 we 
develop an important notion of almost symmetric flow that is a generalization of symmetric flow 
used in [Belllbj . Finally, in Section 14.51 we describe a learning graph that is equivalent to the 
previous quantum algorithm for the /c-distinctness problem. 

4.1 Symmetries 

In [Bell lb] , the symmetries under consideration were those of the indices of the input variables. 
This was sufficient because values of the variables did not affect the learning graph. In this paper, 
we consider a wider group of symmetries, namely S n x S m , where S is the full symmetric group, 
that in the first multiplier permutes the indices, and the second one the values of the variables, 
i.e., an input x = (xi) ie [ n ] gets mapped by a = a\ x a Q to ax = {a (x aii )) ie [ n ]. 

Let EC5„x S m be the symmetry group of the problem, i.e., such that /(ex) = f{x) for all 
ieD and u£E. For the fc-distinctness problem, £ equals the whole group S n x iS m . 

We extend the mapping x t— > ax to assignments, as well as vertices and arc of learning graphs in 
an obvious way. For example, an arc e € Q (x) from 5* to S U {v} is mapped to the arc ae € Q {ax) 
from a\S to ai(SU {v}). Actually, graph Q(ax) may also not contain the latter arc. To avoid such 
inconvenience, we assume Q is embedded into the complete graph having all possible arcs of the 
form es^u{v}, with the unused arcs having weights and flow equal to 0. Then, it is easy to see any 
a e £ maps a valid flow on Q (x) to a valid flow on Q (ax) in the sense of Definition [7J Of course, 
the complexity of the latter can be huge, even +oo, because it may have a non-zero flow through 
an arc having weight 0. Consider two arcs: 

e-i € G{x^) originating in Si and loading Vi, for i = 1,2. (4) 

In this section, as well as in Section ^. 21 we are going to define various equivalence relations between 
them, mostly, to avoid the increase in the complexity of the flow under transformations from S. 
Note, in contrary to [Belllbj . we define equivalences between arcs, not transitions, i.e., chains of 
arcs. 

Equivalency Arcs e\ and ei are called equivalent iff there exists a £ £ such that ai(v{) = i>2 
and a(x^~l) — Xg . It is natural to assume equivalent arcs have equal weight. We give a formal 
argument in Proposition 1101 

Denote by £ the set of all equivalency classes of Q under this relation. Also, we use notation 
£i for all equivalency classes of step i (an equivalency class is fully contained in one step, hence, 
this is a valid notion). If E E £, we use notation E(x) to denote the subset of arcs of Q{x) that 
belongs to E. 

For the fc-distinctness, the equivalence is characterized by the structure of the subtuples of 
S. We capture this by the specification (3(S) of the vertex, i.e., by a list of non-negative integers 
(fix, &2j • • • j bk-i) such that S contains exactly b t t-subtuples. (If S contains a k- or a larger subtuple 
it is an accepting vertex and no arcs are leaving it). In particular, IS*! = ^2 t tb t . Thus, two arcs 
are equivalent iff the specifications of their origins are equal. 

Strong equivalency The first part of this section describes the equivalence relation for arcs e\ 
and &2 with respect to their weight. We would like to get a stronger equivalence that captures the 
flow through an arc. This kind of equivalency has already been used in [Belllbj without explicit 
definition. 
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Arcs ei and e^ from Q are called strongly equivalent iff there exists an element a G E such 
that 

a (xW) =x { - 2 \ <7i(Si) = S 2 and oi(«i) = wa. (5) 

Again, due to symmetry, it is natural to assume the flow through strongly equivalent arcs is 
equal. If, for some positive inputs x^ and x^ 2 \ there is a G E such that the first condition of (|5|) 
holds, the task of finding a flow for x^ is reduced to finding a flow for a;*- 1 ) , that is again a corollary 
of Proposition [TU] See also Proposition [11] 



Formal argument We give a proof that, without loss in complexity, we may assume weight and 
flow is constant on equivalent and strongly equivalent arcs, respectively. 

Proposition 10. For any learning graph Q , it is possible to construct a learning graph Q' and a 
flow p' on it with the same or smaller complexity, so that equivalent arcs have the same weight and 
strongly equivalent arcs have the same flow through them. 

Proof. The proof is a standard application of symmetry. Let p be an optimal flow for Q . We define 
the weights of arcs in Q' and the flow through it as follows: 

w' e ( a ) = |4r E w °e(< 7a ), and Pe( x ) = j^j E Voe{ox). 

If arcs of (0} are equivalent, there exists a' such that er'ei = e-i and a' '{x^) — x^\ Hence, 

<M 2) ) 4E 4E ^'ex(^(4?)) = <(* (1) ). 

since E is a group. The equality of flows is proven in a same way. Let us check the complexity. 
For a negative input y, we have: 

W(y)) = E m E w ~ipv) = 4 E Wfa))- 

Hence, for at least one a, N{G'(y)) < Af{G{cry)). Thus, N{G') < Af(G). 

For the positive case, at first note that p' is a valid flow, as a convex combination of valid flows. 
For any x G we have 

v(g'(x)) < E E p°e(™)) (w\ S ^c^ol 



^ E ]e[ E = ]s[ E m«0). 



The second inequality follows from the Jensen's inequality for the square function (X^o-es 1<j z o) 2 ^ 
So- e s7er^, with 7 CT = w ae {ax)l (2 CTfES w o-e (era)) and z a = p ae (ax)/^ a . Due to the same argu- 
ment, T(G') < V{G). □ 



4.2 Loosening equivalencies 

Although equivalencies defined in the previous section are optimal, they are not always convenient 
to work with. They turn out to be too strong, that results in a vast number of equivalency classes 
that should be treated separately. In this section, we describe a number of ways to loosen these 
equivalences, thus reducing the number of classes and making them easier to work with. 
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Equivalency Assume the weight function is decomposed as w e (a) — w e (9(a)), where is some 
"filter" that captures the properties of a we are interested in. It is good to assume symmetry 
preserves 9, i.e., 9{a\) — 6(0.2) implies 9(aa\) = 9(aot2) for any a G E. Transitions e\ and e2 are 
called 9-equivalent iff there exists a G £ such that o\(v\) = vi and a(9(x^)) — 9(x^). It is again 
natural to assume 0-equivalent arcs have the same weight. Two main examples are: 

01, the identity. This results in the relation from the previous section. This is the main equiva- 
lency used in Section [5] 

02, mapping a to its domain T>(a). For the fc-distinctness, arcs e\ and e2 from (|4]) are 02- 
equivalent iff |Si| = \Sz\. This is the equivalency used in [Bell lb] . 

Strong equivalency Unlike equivalency, strong equivalency turns out to be too strong for all 
our applications. We can weaken it by considering Q as a learning graph for function / that gets 
as input x, the original input x with some information removed. 

More precisely, extend the output alphabet [to] with a set of special characters Q. Let $ : / _1 (1) — > 
([to] U Q) n be the function that maps x to x. We extend elements of S to ([to] U Q) n by assuming 
cr (c) = c, if <7j x a G £ and c € Q. 

The function /: ([to] U Q)" -> {0,1} is defined as f(d(x)) = 1 for all x G / _1 (1). Let £ be 
the same learning graph as Q but calculating /. Arcs e\ and ei from are called "d-strongly 
equivalent iff the corresponding arcs e\ G ^(^(a^ 1 ))) and §2 G t/(^(a;' 2 ' ) )) are strongly equivalent. 

For this construction to work, wc require a stronger definition of a 1-ccrtificatc for /. We say 
an assignment a: [n] D M — > [to] U Q is a 1-certificate if, for all x G / _1 (1) such that i?(a;)M = a, 
Xm is a 1-certificate of x in /. With this definition, any valid flow in Q{&(x)) is simultaneously a 
valid flow in Q(x). 

We give three examples of i?'s. 

t?i, the identity. This results in the relation from the previous section. We have no example of 
using this equivalency; 

i?2j the equivalency used in [Bell lb] . Let Q — {-,*}■ For a positive input x, fix some 1-certificate 
a. Let M be the domain of a. The elements of M are called marked. Define x = v } 2(x) as 




7k-, i G M; 
•, otherwise. 



Clearly, iJjf^M is a 1-certificate. Refer to Section H31 for an example of usage of this 
equivalency 

$3, defined in Section [4.31 The main equivalency used in Section [5] 

Again, we assume the flow through "^-strongly equivalent arcs is equal. The equivalencies we 
use in the paper possess two additional symmetric properties. Firstly, if x^\x^ 2 ' G / _1 (1) and 
a G X are such that a(i9(x^ 1 ' > )) = i9(x^) then, for any 1-certificate a of t9(a;W), act is a 1-certificate 
oftf^ 2 )). Secondly, any t9-strong equivalency class, having non-zero flow through it, is completely 
contained in some 0-equivalency class. 

Proposition 11. If 9, x^\ x^ and a flow p e on Gi^ix^)) satisfy the above conditions and 
9-equivalent arcs in Q have the same weight, then o~p e is a valid flow on G(i9(x^)) with the same 
complexity (that, also, is a valid flow on Q(x^ 2 ')). 
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tf 2 ^ — tf 3 ^ — t?l 



Figure 1: Relations between equivalencies. Arrows are from a more strong relation to a weaker one. 
Id is the identity relation from Section 13.11 #'s are equivalencies and i?'s are strong equivalencies. 
Implication from $3 to 0% only holds for arcs with non-zero flow. 



4.3 Conventions for ^-distinctness 

Strong equivalency, as defined in Section 14.11 turns out to be too strong for the fc-distinctness 
problem, because for most of the pairs x^^x^ € / _1 (1) there is no a such that c^ 1 )) = x^ 2 \ 
and, hence, no arcs from Q(x^) and G(x^) can be strongly equivalent, whatever Q is. We use 
the loosening tool of Section |4~^1 to define $3 so that there always exists a that maps ^(x' 1 ') to 
$3(a^ 2 ')- Then, by Proposition !!!! defining a flow for any positive input x is enough to get a flow 
for all positive inputs. 

Let the set of special characters be Q — {■}. Fix an arbitrary positive input x. First of all, we 
identify a subset M of fc equal elements (the marked elements in terminology of $2)- Next, due to 
the condition in Theorem [TJ we may assume there are non- negative integers l\, . . . , ik-i such that 
in any valid input (either positive, or negative) there are at least It i-tuples and 

fc-i 



We arbitrary select £ t i-tuples. Denote by A t the union of the selected i-tuples. We also use 



notation A> t to denote [J^ = l Aj. We define x = $3(2;) as 



. _ J x it i£ A>i U M; 
otherwise. 

Clearly, assignment xm is a 1-certificate. In the learning graph for /, defined using #3, we will 
have p e (x) = if the origin of e has at least one • in $3(2;). This convention assures that 6\ and 
#3 satisfy the conditions of Proposition [TT] Further, we are going to ignore vertices having -'s in 
them. Figure [T] describes which of the defined equivalence relations imply which. 
Let us use this spot to mention one more convention on the input. Namely, 

Vt<k-l:£ t = n(n). (6) 

Any other case can be reduced to this one by extending the input by n i-tuples with elements 
outside the range of the original problem, for each t < k — 1. 

The strong equivalency class (with respect to $3) of an arc depends solely on the types of its 
initial and target vertices. The type /3(S) of vertex S is an (fc — 1) x fc-matrix (&{,«), where bt tS 
is the number of i-subtuples of S contained in A s (or M, if s = fc). Most of the time, we will 
implicitly assume bt : k — for all t's, hence, describe the type of a vertex by an (fc — 1) x (fc — 1)- 
matrix, assuming the removed row contains only zeroes. Note also that the specification (& t ) can 
be expressed using the type: bt = J2 s =i ^M- 

We will have to measure distance between types. When doing so, we treat them as vectors. 
I.e., the distance between types f3(S) = (b Syt ) and f3(S') = (b' st ) is defined as 

\\P(S)-P(S')\\ 00 =max\b s , t -b' Sit \. 

S.t 
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Negative complexity Here we estimate how the restriction from the actual number of t-tuples 
in the input to it ones in A>± affects the negative complexity. 

Lemma 12. Consider a set A / >1 that is defined similarly to A>i, only it has £' t t-tuples. Assume 
\@t ~ i't\ < d = °( n ) f or a M t' s - Let (bt) be any specification such that b = ^~J t &t = °( n )- Then the 
ratio of the number of subsets satisfying (bt) in A>\ and A' >1 is at most e°( ' n K 

Proof. It is straight- forward to calculate the number of subsets of A>i satisfying specification (bt). 
Indeed, it equals 



k-l 

En 



4 \ f s\ bl s (l s - h, s \ ( s\ 2 ' s (l s - bi lS b s -i, s \ ( 8 s 1 



frl.s/V 1 / V & 2.s / V 2 / V & 



(7) 



where the summation is over all types (bt >s ) that agree with specification (bt). Eq. (J7J, with £ t 
replaced by £' t , gives the corresponding number of subsets in A' >1 . It is enough to show that each 
multiplier featuring l s in ([7]) changes by at most a factor of e °( db / n ) . But we have: 

r --^-^--^)f-^-^--^)-(i +0 ©) ow -^». 

where we used that i s = 6(n), because of ©■ □ 

Since the complexity mentioned in Theorem[T]is o(n 3 / 4 ), it is natural to assume no vertex of the 
learning graph has more elements. It's actually the described in Section [SJ The precision 

0(tfn) in the formulation of Theorem [T] has been chosen so that restriction of the flow to A>i UM 
does not hurt the negative complexity, as it can be seen from the next 

Corollary 13. Fix any possible negative input y, and any valid specification (bt) with all entries 
o(n 3 / 4 ). Then, the number of subsets of [n] satisfying (bt) is bounded by a constant times the 
number of such subsets included in A>\. 

Because of this, we may act as if the set of input variables is A>i U M, not [n]. 



4.4 Almost symmetric flows 

Assume the following scenario. We have chosen which equivalency classes will be present in the 
learning graph. Also, for each positive input, we have constructed a flow. The task is to weight the 
arcs of the learning graph to minimize its complexity. In this section, we define a way of performing 
this task, if the flow satisfies some requirements. 

For the fc-distinctness problem, all arcs leaving a vertex are equivalent, hence, to specify which 
equivalency classes are present, it is enough to define which vertices have arcs leaving them. For 
each step, we define a set of valid specifications. If a vertex before the step satisfies one of them, 
we draw all possible arcs out of it. Otherwise, we declare it a dead-end and draw no arcs out of it. 

The flow is called symmetric in [Be lllb] if, for each equivalency class, the flow through an arc 
of it is either 0, or p, where p does not depend on the input, but may depend on the equivalency 
class; also it is required that the number of arcs having flow p does not depend on the input as 
well. This notion was sufficient for the applications in that paper, because f^-strong equivalence 
was used, and that is easy to handle. In this paper, we use ^3-strong equivalence, and it is not 
enough with symmetric flows. Thus, we have to generalize this notion. 
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Definition 14. The flow is called almost symmetric if, for each equivalency class E, there exist 
constants tt(E) and t{E) such that, for each positive input x, there exists a subset G(E, x) C E(x) 
such that 

t(E)\G(E,x)\ = &( max \E(y)\ \ , ]T Pe (x) 2 =tt (V » e (x) 2 ) 

and Ve e G(J5,x) : p e (x) = 6 (tt (£?)). 

The elements inside G(E, x) are called typical arcs. Number t(_E) is called the speciality of the 
equivalency class (as well, as of any arc in the class). We also define the typical flow through E as 
fi(E) = n(E) max xe j-i(x) \G(E, x)\. It is straight-forward to check that 

Vxef- 1 (l): f i(E) = 0( PE (x)). (9) 

Theorem 15. If the flow is almost symmetric, the learning graph can be weighted so that its 
complexity becomes O [J2e<e£ l I ( E )\n{E) 



Proof. For each arc e in an equivalency class E, we assign weight w e = tt(E) j \Jr(E) . Let us 
calculate the complexity. For each y E / _1 (Q), we have the following negative complexity 

£ w E \E(y)\ = J2 J 7=\ E (V)\ = ° \ E < E )VW) max \G(E,x)\) . 

EES EES V T y- tj > \Ee£ ^ ( ' ) 

For a positive input x £ / _1 (1), we have 

E^: E p^) 2 = o(Y,^\G(E, x )HEA=o(^mV^)- 

EES E eEE(x) \Ee£ 1 > J \Ee£ / 

By combining both estimates, we get the statement of the theorem. □ 

For each step i, define Tj = max^gf; t(E). Then Theorem [TS] together with © and the 
observation that the total flow through all arcs on any step is at most 1, implies the following 

Corollary 16. If the flow is almost symmetric, the learning graph can be weighted so that its 
complexity becomes O Vm) where the sum is over all steps. 



4.5 Previous Algorithm for ^-distinctness 

As an example of application of Corollary 1161 we briefly describe a variant of a learning graph 
for the /c-distinctness problem. It is a direct analog of an algorithm from [Amb07] using learning 
graphs and a straightforward generalization of the learning graph for element distinctness from 
[Bell lb] . 

To define equivalencies between arcs, we use 62 and $2 from Section 1431 The learning graph 
consists of loading r + k elements without any restrictions (as imposed by 62), where r is some 
parameter to be specified later. We refer to the first r steps as to the first stage, and to the last k 
steps as to the second stage. 

Clearly, all arcs of the same step are equivalent. Consider strong equivalency. Let 1 be a 
positive input and let M be a subset of k equal elements in it. We use M as the set of marked 
elements to define $2- Then, the strong equivalence class of an arc is determined by the number 
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of elements in its origin, the number of marked elements among them, and whether the clement 
being loaded is marked. 

The flow is organized as follows. On the first stage, only arcs without marked elements are 
used. On the second stage, only arcs loading marked elements are used. Thus, on each step only 
one strong equivalency class is used, and the flow among all arcs in it is equal. 

It is easy to check this is a valid flow for fc-distinctness and it is symmetric. Let us calculate 
the specialities. The first r steps have speciality O(l). The speciality of the i-th step of the second 
stage is 0(n l /r 1 " 1 ). This is because the fraction of (r + i— l)-subsets of [n] containing i — 1 marked 
elements is ©(r 2 " 1 /^ 1-1 ); and k — i + 1 arc only, out of O(n) originating in such vertex, is used by 

the flow. Hence, by Corollary [THl the complexity of the learning graph is O (r + ^/n fc /r fe_1 ^ that 
is optimized when r = n k ^ k+1 ^ and the optimal value is O (n fc /( fc+1 )). 

5 Algorithm for /^-distinctness 

The purpose of this section is to prove Theorem Q] In Section 1531 we give some intuition behind 
the learning graph. In Section [5. 2[ we describe the learning graph, or, more precisely, define valid 
specifications for each step, as described in Section 14.31 In Section 15.31 we define the flow, and 
give preliminary estimates of the complexity. Finally, in Section [5.41 we prove the flow defined in 
Section [5.31 is almost symmetric and prove the estimates therein are correct. 

5.1 Intuition behind the algorithm 

There is another way to analyze the complexity of the learning graph in Section 14.51 

Lemma 17. Assume convention (0) on the input. The expected number of t-subtuples in an 
r-subset of A>\, chosen uniformly at random, is 0(r*/n t_1 ). 

Proof. Let S be the random subset. Denote n' = |A>i|. The probability a fixed subset of t equal 
elements from A s forms a i-subtuple in S is (™ r t s )/(" ) = 0(r*/« 4 )- The number of such subsets 
is X)s^s(t) = ®{ n )- Hence, by linearity of expectation, the expected number is 0(r'/« t_1 )- □ 

Consider the following informal argument. Let M be the set of marked elements as in Sec- 
tion 14.51 Before the last step, the flow only goes through vertices S having \S PI M\ = k — 1. Fix 
a vertex S and let M' = M PI S. One may say, M' as a (k — l)-subtuple, is hidden among other 
(k — l)-subtuples of S. The expected number of such is 0(r fc_1 /n fc_2 ), total number of (k — 1)- 
tuples is O(n), hence, the fraction of the vertices used by the flow on this step is 0(r fc ~ 1 /n fe ~ 1 ). 
Thus, the speciality of the arc loading the missing marked element is Q(n k /r fc_1 ) that equals the 
estimate in Section l4~5l 

As such, this is just a more difficult and less strict analysis of the learning graph. But one can 
see that the speciality of the last steps depends on the number of i-subtuples in the vertices. We 
cannot get a large quantity of them by loading elements blindly without restrictions, but it is quite 
possible, we can deliberately enrich vertices of the learning graph in large subtuples by gradually 
filtering out vertices containing a small number of them. 

5.2 Description of the Learning graph 

We would like to apply Corollary [121 hence, it is enough to give valid specifications for each step. 
We do this using a pseudo-code notation in Algorithm [T] 

Here ri, . . . , r^-i are some parameters with r.; + i = o(ri), r% = o(n) and r^-i = w(l) to be 
specified later. Also, it will be convenient to denote rg = n. The commands of the algorithm 
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Algorithm 1 Learning graph for the ^-distinctness problem 

1: for j <— 1 to r\ do 
2: Load an element 

3: end for 

4: Declare as dead-ends vertices having more than ctr*/n* _1 i-subtuples for any t = 2, . . . , k — 1 

5: for i <— 2 to k — 1 do 

6: for j 4— 1 to rj do 

7: for Z <— 1 to i do 

8: Load an element of level Z 

9: end for 

10: end for 

11: end for 

12: Load an element / / The last element is no subject to any constraints 



define the specifications as follows. The loop in lines [T] — [3] says there is no constraint on the first 
n steps. Line 0] introduces the original specifications. Here, Ct > are some constants we specify 
later. 

The loop in Lines I5T- fTTI describes how the specifications change with each step. Assume a step, 
described on Line [51 loads an element of level I. Then, a valid specification (bt) before the step is 
transformed into a valid specification (b' t ) after the step as follows 

(b t + l, t = l; 
b' t =lb t -l, t = l-l; 
[&t, otherwise. 

In other words, if there is an arc between vertices of specifications (b t ) and (b' t ) and it load v then 
there exists an (Z — l)-subtuple Q of S such that Q U {v} is an Z-subtuple of S U {v}. In fact, only 
such arcs will be used by the flow, as it is described in more detail in Section [5.31 

Hence, for each specification in Lines I5T- it is possible to trace it back to its original speci- 
fication. For example, if (bt) is a specification of the vertex after step in Line [SJ with the values of 
the loop counters i, j and Z, the original specification is given by (bt) — (S l t ), where 

(bt-r t , 2<t<i; 
bt = I bt-j + l, t = i; 

\b t , otherwise; 

Moreover, the use of the arcs in the flow, as described in the previous paragraph, implies the flow 
through all vertices having some fixed original specification is the same for all steps. 

Finally, the step on Line [T^] loads the last element, and there is no need for the dead-end 
conditions, because after the last step all vertices have no arcs leaving them. 

Naming convention We use the following convention to name the steps of the learning graph. 
The step on Line [5] is referred as the j-th step of the first stage. The step on Line [5] is referred 
using triple (i,j,l), except for the case i = k— 1 and j = r^-i. The latter together with the step 
on line Line [12] is referred as the steps 1,2, ... ,k of the last stage. The steps of the form (i, ■, ■) are 
called the i-th stage. Altogether, all steps of the form (•, •, •) are called the preparatory phase. 



and S t = 



1, t = l- 

0, otherwise. 
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5.3 Flow 



We two possible ways to define a flow. The first one is to set the flow through the arcs on each 
step so that the flow through all vertices on each step is the same. We believe this can be done, 
but we lack techniques to deal with this kind of arguments. 

Instead of that, we select the second way. For each vertex, we divide the flow evenly among all 
possible arcs. Because of this, the ratio of the maximal and the minimal flow accumulates with each 
step, and at the end it is quite large. We avoid this complication by applying the concentration 
results stating that for large n's almost all flow will be concentrated on some typical subset of arcs 
and will be distributed almost evenly on it. 

First Stage For the first stage, we use #2- and ^2-based equivalencies, akin to the first stage of 
the flow in Section [4.5l Consider the uniform flow, i.e., such that distributes all the in-coming flow 
among all out-going arcs, leading to an element of A>i, equally. Clearly, it is symmetric, and the 

flow through any vertex S C A>\ after the first stage is ('^r 1 ') • The speciality of each step in 
this flow is O(l) because of Corollary fl3l 

But this flow is non-zero for vertices declared as dead-ends in Line0] We fix this by applying 
Lemma [5] We have to choose ct > so that, with probability, say, 1/2, an uniformly picked subset 
of size r\ satisfies a valid specification. And it is possible to do so due to Lemma [T7] and Markov's 
inequality. 

After performing the conditioning, the complexity of the flow in the first stage increases by at 
most a constant factor (that can be ignored), and all non-dead-end vertices have the same flow 
through them, we denote p . 

Preliminary Estimates For the remaining stages, we use 6\ and $3 to define equivalences 
between arcs. Here we informally analyze the flow for Lines I5T— H2l of Algorithm [TJ assuming there 
is flow p through all non-dead-end vertices after Line|H The formal analysis is done in Section IBT41 
Roughly speaking, the flow is organized as follows. On step an element, not in M, 

belonging to level I is loaded. On any step of the last stage, an element of M is loaded. Let 
us estimate the complexity of the learning graph. Assume for the moment the flow is almost 
symmetric. 

Approximately n arcs are leaving a vertex on each step. Let (i, j, I) be a step of the preparatory 
phase and assume I > 1. An element of level I is loaded, and there are f2(n-i) (I — l)-subtuples in 
the vertex that can be extended. Hence, f2(n_i) arcs leaving the vertex can be used by the flow. 
This makes the speciality of the step equal to 0(n/r;_i). This is true for I = 1 as well, because of 
the convention ro = n. 

Now turn to the last stage. Let us calculate the speciality of a vertex used by the flow on step 
j > 1 of the last stage. Let Vb be the vertices contained in A>\ having a valid specification, and 
Vm be the vertices of A>\ U M that can be used by the flow. Define relation tp, where Sq £ Vb is 
in relation with Sm £ Vm if Sm can be obtained from Sq by removing one of its (j — l)-subtuples 
and adding j — 1 elements from M instead. Each Sq has f2(ry_i) images and each Sm has 0(n) 
preimages. Hence, |Vo|/|Vm| = 0{ n l r j-\)- Because only 0(1), out of 6(n) arcs leaving a vertex 
from Vm, can be used by the flow, we have the speciality of step j of the last stage equal to 
0(n 2 /rj-i). This also is true for j — 1. All this is summarized in Table [TJ 

If we could apply Corollary \W[ we would get the complexity 
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Step 


First stage Preparatory, (•, -,l) Last stage, j-th 


Speciality 
Number 


1 n/ri-i n A /rj-\ 

n n l 



Table 1: Parameters (up to a constant factor) of the stages of the learning graph for the k- 
distinctness problem. 



Denote pi = log ra and assume all the addends are equal. Then 

1 , gj-i 1 , Pi ■ 1 u t 

2+ = 2 ~ 2"' » = !,•••>«- 1 

where we assume po = 1 an d Pfc = 1/2. It is equivalent to — = — p%)/2. Hence, 

1/2 = po - pfc = (2 k - l)(p k -i - Pk)- 

Thus, the optimal choice of py is 1 - 2 k ~ 2 /(2 k - 1). 

We use these calculations to make our choice of = n Pi . It remains to strictly define the flow, 
prove it is almost symmetric and the estimates in Table Q] are correct. Before doing so, we combine 
some estimates on the values of r% 's in the following 

Proposition 18. We have y/rxr% = o(n) and yjv\ = o(rj) for any i. Also, any valid specification 
on stage i, has ®(rj) j-subtuples for j < i. 

Proof. The first equation follows from pi < 3/4 and P2 < 5/8. The second inequality follows from 
Pi > 1/2 for all i's. 

Due to Line S] of the algorithm, after the first stage, any valid specification has 0{r\jn l ~ ls ) 
i-subtuples. For % > 1, it is o(y/n) = o{r,j) for any j. Hence, after the first stage there are 0(ri) 
1-subtuples, and this number does not substantially change after that. Similarly, if one doesn't 
take into account the ±l-fluctuations, the number of j-subtuples is changed only on stage j, when 
rj j-subtuples are added. □ 

Values of the flow Let us describe how the flow is defined. Fix some stage i. A vertex before 
a step of the form (i, •, 1) is called a key vertex. Consider a key vertex S with type (&{,«)• The flow 
from S is distributed evenly among all succeeding key vertices, where 5" is a succeeding key vertex 
for S iff S' \ S is a subset of equal elements having a value different from any element of S. The 
number of succeding key vertices for S is 



where 



N(S) = Di (E t=1 & M,---,E t=1 ^-i) 



fc-i 

Di{z x , Zk-i) = (4 - Zs) 



s—i 



is the number of possible i-subtuples to extend the vertex with, when z s s-tuples have already been 
used. 

More precisely, let e be an arc of step (i,j, I) originating in a non-dead-end vertex S' and loading 
an element v. Then the flow through this arc is defined using the values of the flow through key 
vertices before step 1) as follows: 

s\ p S '\q \Q\ = I and s > i, where Q = {l £ S'U{v} \ x L = x v } 

Pe= { V IN(S'\Q) ' an( i s i s sucn t ]j at q i s contained in A s ; (10) 

otherwise. 
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If the first case in (fTU|) holds, vertex S' \ Q is called the key vertex preceeding arc e. Note that it 
is uniquely defined. 

5.4 Analysis of the flow 

Typical vertices The point of this section is to prove the flow defined in Section lijUl is almost 
symmetric. For this, we should identify the set of typical arcs. Before doing so, we define typical 
vertices. 

Let j3 = (b t ) be a valid specification of the preparatory phase. Select any t G [k — 1] and let X t 
be the collection of all subsets of A>i consisting of bt t-subtuples. In other words, elements of X t 
satisfy specification (0, . . . , 0, bt, . . . , 0). Denote e t , s — Esex t [°t,s(S)], where b tyS (S) = \S fl A s \/t 
is the element of (3(S). Denote eg = (e tiS ). 

A type (bt, s ), consistent with (3, is called typical if it is inside B(ep, Cy^), where C is a constant 
to be specified later, i.e., if for all t and s holds \bt, s — e f , s | < C^fr~\. A typical vertex is one of a 
typical type. Let us state some properties of the typical vertices. 

Lemma 19. Let (b t . s ) be the type of any typical vertex on the i-th stage. Then, for all t < i and 
s > t, we have 6t jS = Sl(r t ). 

Proof. Since yjr\ = o(r t ), it is enough to show that e t . s — Q(r t ). Let S be an element of X t . 
Arbitrarily order its subtuples: S — {s±, . . . , Sb t }. Clearly, the expectation is the same for ordered 
and unordered lists of subtuples, so let us consider the former. 

By linearity of expectation, e t ,s = &tPr[si C A s ]. The number of sequences having si in A s 
is ^s(j) times the number of ways to pick the remaining bt — 1 t-subtuples out of A>\ where one 
s-tuple cannot be used. By and Lemma [T21 these numbers are equal for different s > t, up to 
a constant factor. Hence, the probability is 0(1), and since b t = Q,(rt), we have et jS = £l(r t ). □ 

Lemma 20. For any valid specification /3 of the preparatory phase and for any X > Cyjr\, 

Pr[||/3(5)-^|| 00 >A]<e-^ A2 /'- 1 ), (11) 

where the probability is uniform over all subsets S of A>\ satisfying (3. 

We derive the lemma from the following two pure technical results 

Proposition 21. Let H be the disjoint union of [Ht)te[k] where each H t is a rectangular array 
of dots, having £ t columns and m t rows. Let X be the set of all r-element subsets of H where no 
subset has more than 1 dot from any column of any H t . Occupy X with the uniform probability 
distribution and let h t : X 3 S \S H H t \. Assume k = 0(1), it — 0(n) and r — o(n). Then: 

Pr[\h t -E[ht}\ > A] <e- n ^/ r \ (12) 

for any A > 0. 

Proof. This is a standard application of Azuma's inequality (Theorem [5]) . Suppose, we sort the 
elements of each S G X in any order: S = {s\, . . . , s r }. Clearly, the probability equals for unsorted 
and for sorted lists. We use both interchangeably in the proof. 

Let Di be the Doob martingale with respect to this sequence. We have to prove that \Di — 
Di-i | = O(l), i.e., the expectation of ht does not change much when a new element of the sequence 
is revealed. To simplify notations, we prove only \D\ — D$\ = O(l), the remaining inequalities 
being similar. 

For the proof, we define two other classes of probability distributions, all being uniform: 
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• Yi. r-subsets of H \ Q, where Q is a fixed column of Hi 

• Zi. (r - l)-subsets o£H\Q. 



For the martingale, it is enough to prove that, for all i: 

\E[ht \X]-(E[h t \Z l ] + 8 l , t )\ = 0{l), 
where 6i t t is the Kronecker delta. We have 

E[h | X] = Pr[Q n S ? 0] {E[h t \ Z t ] + 5 i>t ) + Pr[Q n S = 0] E[h t | Yi]. (13) 
Hence, it is enough to prove that 

\E\ht\Zi\-E\ht 1^)1 = 0(1). (14) 

Denote i'j = £j — Sij; and let Kh and K' h be the number of elements S of Zi and li, respectively, 
having h t (S) = h. Note that K' h = ^y^K^-i, where h g [r] and 7^ = ((£[ — h + l)m t )/h. Thus 

m 1 Yi\ < = ^ hlh ^ h - 1 < ^r 1 ^ - 1 = i + e[^ 1 zj, 

where the second inequality holds because jh monotonely decreases. Thus, by linearity of expec- 
tation, E[ht I Yi] > E[ht I Zi] — k + 1, for all i, thus proving (TT4")) . An application of Azuma's 
inequality finishes the proof of the proposition. □ 

Lemma 22. Assume i,j £ [k — 1], at least one of them is not 1, andm = 0(1) is an integer. Let /i 
be a probability distribution on W 71 such that /i(R m \B(\)) < e~ ClX l Ti for any A > Ci^frl. Assume 
w is a positive real function, defined on the support of \x, such that w(x)/w(y) < e C3rj ^ x ~ v ^°°/ n 
for any x,y. Here C\, O2, O3 are some positive constants. Then there exists a constant C > such 
that 

I w{x)dn{x) = e - n( - x ^ r ^ [ w(x)dfi(x) 

for any A > C^r\. 



Proof. In the proof, C with a subindex denotes a positive constant that may depend on other O's. 

Let v be a measure on ]02y / r7, +oo[ such that j/(]A, +oo[) = fi(W n \ B(X)). The worst case, 
when the mass of p, is as far from the origin as possible, is when f(]A, +00 [) = e~ ClX l Ti . In this 
case, v{t) = g(t) dt with g(t) = 2£Ll e -c 1 t 2 /n, 

There exists a point y in the support of \i such that \\y\\<x> < Ci\Jri. Without loss of generality, 
we may assume w(y) = 1. Consider 



D = / w(x) dfi{x) > ii{B{C 2 ^)) inf w(x) > (1 - e -CiC 2 y-2C 2 C 3 r js /n/n _ 

Then, for any A > Ci^Jrl, 



If 1 



D V» m \0(A) D 



+00 



w(x)d f i{x) < - I e C3r ^ t+C2 ^ )/n g(t)dt 

(15) 

+°°C 4 t ( n r jy /fi rjt r t 2 \ rU 

exp C 5 ^-^ h C 3 — Ci— dt. 

r, \ n n ri J 
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Denote t = tj \ff\. Then the expression in the last exponent can be rewritten as 



3/2 



r ■ r, 



n n Ti n n n \ Wi n^Jr\ 

The coefficients of the last polynomial can be estimated as follows: 

■ < Vr^ 1 = (){]) and ^<V^i = 0(1)) 



nr\ n «V r i 

by Proposition IT51 This means there exist C^ : Cj > such that, for any A > Cq^/ti, the right 
hand side of (fT5|) is at most 

+ °° ^ P ~C 7 t 2 /n rf/ _ _^L p -C 7 A 2 /n _ -n(A 2 /n) 
2C 7 

if A > Cy 7 ?! for C large enough. □ 

Proof of Lemma^Ui Let 5 be the random subset. Denote the set of i-subtuplcs of S by S t . We 
apply Proposition [5T] to St with k — 1 i?t's given by m s = (*) and r = b t = 0(r t ). Thus, if 5 t had 
uniform distribution, Eq. (|12j) would hold, that would imply (fTTj) . because there are O(l) possible 
choices of s and i. 

But in S, St does not have uniform distribution. Each St is assigned weight ws t that is 
proportional to the number of subsets of A' >1 having specification (b' t ), where A' >1 has tj — hj(St) 
j-tuples in the notations of Proposition [51] b' t — and b'j = bj for j ^ t. 

Take two St and S' t , and assume \\/3(St) — /3(S' t ')|| 00 < d. We apply Lemma [T2"l There are 
two cases. If t > 1, the lemma implies ws t /ws' = e°^ dri ^ n \ If f = 1 then ws t /ws> — e°( dr2 ' n \ 
Anyway, either r in (|12jl . or r in the estimation oiws t /ws* is not ri, and, hence, Lemma l2"2l applies . 
finishing the proof of the lemma. □ 



Divergence in the flow After we have defined typical vertices, we are going to show that almost 
all flow goes through them. But before we do so, we show get an estimate of the divergence of the 
flow in the distance of the types. 

Lemma 23. Suppose two key vertices S and S' of the same specification satisfy \\/3(S) — f3(S')\\oa < 
d. Thenps/ps> =e°( dr2 / n K 

Proof. Denote (b t ) — (3(S) — (3(S'), and b — J^t Let the original specification of the vertices be 
(c t ), and c = J2t c t- 

Fix some order of subtuples in S and S' so that the sizes of the i-th subtuple in S and S' are 
equal for any i. Denote this common value by v(i). Also, let 8 s (i) be 1 if the i-th subtuple of S is 
contained in A s , and otherwise. Define 8' for S' similarly. 

Let S be the set of possible sequences of how the subtuples could have been loaded. I.e., for 
each element of S, the first c subtuples have specification (c*), and the remaining b—c subtuples are 
in a non-decreasing order with respect to their sizes. Moreover, the order of the first c subtuples is 
irrelevant, i.e., no two distinct elements of £ have their tails of last b—c subtuples equal. In these 
notations, 

b ( 3-1 3-1 A _1 

Ps=PoYl II D »i°i) ( zJj =1 ^i( cr *)'---'E J=1 5 *>-i(<ri) J , (16) 

crgSj=c+l V ' / 

where p a and D are defined in Section [5.3l A similar expression works for 5" as well, if one replaces 
8 by 8'. 



19 



Since the distance between the types of S and S' is d, one can define the order of the subtuples 
so that 8 s (i) = S' s (i) for all s's and all, except at most O(d), i's. In this case, for all a, s and j: 



O(d). 



i=l i=l 



Then the ratio of the D's in (JTHJ) is at most 1 + 0(d/n). Since there are 0{rz) multipliers, the 
ratio of the products in (| 16[) for the same a is at most 



Finishing the proof Finally, we are about to prove that the statement of Corollary [161 applies 
for the flow. Call an arc on preparatory or last stage typical if the preceding key vertex is typical 
and the flow through the arc is non-zero. We show that conditions of © hold for a fixed value 
of x G / _1 (1). Then the existence of a strong equivalence between any two positive inputs, as 
in Section EOl implies that (JS| holds for all positive inputs x with the values of n(E) and t(E) 
independent on x. 

Note that the factor m / (l(f\N(S' \ Q)) from ([TU]) is equal for all arcs from a fixed equivalence 
class of the preparatory stage, up to a constant factor. Thus, the main concern is about Ps>\q, 
that is flow through a key vertex. The same is true for the last stage as well. 

We start with the third condition of ((HJ) . It is enough to show the flow differs by at most a 
constant factor for any two typical key vertices of the same specification. The latter follows from 
the fact the types of typical vertices are at distance 0{y/T\), and, hence, by Lemma [251 the ratio 
of the flow is e ^ 2 ^/") =0(1). 

We continue with the second condition. Again, it is enough to show its analog for key vertices. 
The latter is a direct consequence of Lemma [22] applied to the estimates of Lemmas [201 and [251 The 
constant C in the definition of the typical vertex is that from the last application of Lemma [22] 

Finally, let us calculate the speciality of each step. Because of Corollary [23 we may calculate 
the speciality as if the set of input variables is reduced to A>\ U M. Consider a typical arc e of 
step Let S be the origin of e. Note that S is typical (this is a consequence of (|14p). If 

I = 1 then we can add any element from an untouched tuple of A>$. Due to ([5]), there are f2(n) 
such elements. 

Now assume I > 1. By the construction of the flow, the non-zero flow is through the arcs that 
load the Z-th element for a subtuple from A>i, By Lemma [19j in S, there are f2(r/_i) (I — 1)- 
subtuples from A>i. In both cases, there are °,( r i-i) arcs leaving S that are used by the flow. By 
Lemma [201 an 0(1) fraction of all vertices is typical, hence, the speciality of an equivalence class 
of step is 0(n/n-i). 

For the last stage, the same argument as in Section [531 applies, concluded by a fact an 0(1) 
fraction of all vertices before the last stage is typical. 

Thus, the flow is almost symmetric and estimates from Table [1] are correct. This proves Theo- 
rem [T] 

6 Summary 

An algorithm for fc-distinctness problem is constructed in the paper, given the prior knowledge of 
the structure of the input. Is it true, the problem can be solved in the same number of queries 
without any prior knowledge? 




And the same estimate holds for the ratio of sums. 



□ 
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Also, the algorithm in Section I4T51 can be used for any function such that its 1-certificate com- 
plexity is bounded by k. For the algorithm in Section[5l it is not clear. So, another (stronger) open 
problem is as follows. Is it true, any function with 1-certificate complexity bounded by constant 
can be calculated in o(n 3//4 ) quantum queries? If so, this would be a far-reaching generalization of 
the quantum algorithm in |CK11) . 
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